Embedding Conditional Event Algebras Into 
Temporal Calculus of Conditionals 



Jerzy Tyszkiewicz 1 ' 
Achim Hoffmann 2 
Arthur Ramer 2 



February 1, 2008 



Institute of Informatics, 
Warsaw University, 

Banacha 2, 
02-097 Warszawa, 
Poland. 
E-mail jty@mimuw.edu.pl. 
Supported by the Polish Research Council KBN grant 8 T11C 027 16. 

2 School CSE, 

UNSW, 
2052 Sydney, 
Australia. 

E-mail { j ty | achim | ramer}@cse . unsw . edu . au. 

Supported by the Australian Research Council ARC grant A 49800112 

(1998-2000). 



Abstract 



In this paper we prove that all the existing conditional event al- 
gebras (abbreviated cea in this paper) embed into the three-valued 
extension (TL|TL) of temporal logic of discrete past time, which the 
authors of this paper have proposed in [TRH01| as a general model of 
conditional events. 

First of all, we discuss the descriptive incompleteness of the cea's. 
In this direction, we show that some important notions, like indepen- 
dence of conditional events, cannot be properly addressed in the cea 
framework, while they can be precisely formulated and analyzed in the 
(TL|TL) setting. 

We also demonstrate that the embeddings allow one to use the na- 
tive (TL|TL) algorithms for computing probabilities of complex con- 
ditional expressions of the embedded cea's, and that these algorithms 
can outperform those previously known. 
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1 Preliminaries and statement of the problem 



1.1 The problem of conditional objects 



Probabilistic reasoning [Pea88] is the basis of Bayesian methods of expert 
system inferences, of knowledge discovery in databases, and in several other 
domains of computer, information, and decision sciences. The model of 
conditioning and conditional objects we discuss serves equally to reason 
about probabilities over a finite domain X, or probabilistic propositional 
logic with a finite set of atomic formulae. 

Computing of conditional probabilities of the form Pr(X|Yi, . . . , Y n ) and, by 
extension of conditional beliefs, is well understood. Attempts of defining first 
the conditional objects of the basic form X\Y, and then defining Pr(X|y) as 
Pr((X|y)) were proposed, without much success, by some of the founders 



of probability [Boo57, dF72|. They were taken up systematically only about 
1980 ||Ada86| , |Lew76| , FS9l |GGNR91| , |GN95| . The development was slow, 
both because of logical difficulties — interpretation of conditionals, and even 
more because the computational model is difficult to construct. (While a\b 
appears to stand for a sentence 'if b then a' and the probability is Pr(a|6) = 
Pr(a A 6)/Pr(6), there is no obvious calculation for Pr(a|(6|c)), nor intuitive 
meaning for a|(6|c), (a|6) A (c\d), and the like.) 

The idea of defining conditional objects was entertained by some founders of 
modern probability [Boo57, dF72(| , but generally abandoned since introduc- 
tion of the measure-theoretic model. It was revived mostly by philosophers 
in 1970's |Ada86 , vF77] with a view towards artificial intelligence reasoning. 



Formal computational models came in the late 1980's [Cal87, GNW91] with 
only one, based on formal fractions and three-valued indicator functions, 
used for few actual calculations of conditionals and their probabilities. That 



model may give results whose values are open to questions | Cal94 ]. 
The authors of this paper have developed in the companion paper [TRH01] 
a temporal calculus (TL|TL) of conditionals, based on the early ideas of de 
Finetti fdF7|. 

In the present paper we show that all the major previously existing systems 



of conditionals, the so called conditional event algebras (see [GMN97] or Sect. 



4.2 in the current paper), embed isomorphically into (TL|TL). Looking at 
them as fragments of (TL|TL), we demonstrate their insufficient expressive 
power and other defects in their construction. 

They attempt to provide certain kind of a model of the logic of conditional 
expressions, built up from simple conditionals of the form (a\b) with the 
connectives: conjunction, disjunction and complementation. 
However, the semantical objects assigned to the expressions are not required 
to be of probabilistic nature, so they fail to provide methods to verify the 
chosen structure experimentally. 

The structure of conditionals is not determined functorially by the space of 



2 



nonconditional events. 

Moreover, very restricted setting of cea's does not allow one to address many 
important questions, like stochastic independence of complex conditionals. 
In the (TL|TL) setting independence can be precisely defined and analyzed, 
unlike in the cea formalism, (cf. Theorem [29| below). 

Finally, we use the algorithms for calculating probabilities in (TL|TL), which 
stem from the highly developed algorithms for calculating limiting probabil- 
ities in Markov chains, and apply them to the embedded cea's. It appears 
that these algorithms clearly outperform the previously known ones for the 
important product space cea [Goo94|. 

Consequently, we believe that our (TL|TL) can be used as a single alterna- 
tive to each of the major cea's considered in the literature so far, superior to 
each of them, in the sense of expressive power, clarity of logical and seman- 
tical structure, and, last but not least, availability of efficient algorithms. 



2 The tools 

2.1 (TL|TL), Moore machines and Markov chains 

We describe briefly the construction of temporal conditionals, presented in 
detail in [fTRHOlfl . 

Let £ = {a, b, c, d, . . . } be a finite set of basic events, and let E be the 
Boolean algebra generated by £, and ft the set of atoms of S. Consequently, 
£ is isomorphic to the powerset of O, and £1 is isomorphic to the powerset 
of £. Any element of £ will be considered as an event, and, in particular, 
£ C S. 

The union, intersection and complementation in X are denoted by aUb, adb 
and a^, respectively. The least and greatest elements of £ are denoted 
and respectively. However, sometimes we use a more compact notation, 
replacing n by a juxtaposition. When we turn to logic, it is customary to 
use yet another notation: a V b, a A b and ~^a, respectively. In this situation 
$7 appears as true and as false, but 1 and 0, respectively, are incidentally 
used, as well. 

3 = {0,1, _L} is the set of truth values, interpreted as true, false and un- 
defined, respectively. The subset of 3 consisting of and 1 will be denoted 
2. 



Let us first define temporal logic of linear discrete past time, called TL. 
The formulas are built up from the set £ (the same set of basic events as 
before), interpreted as propositional variables here, and are closed under the 
following formula formation rules: 



1. Every a £ £ is a formula of temporal logic. 
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2. If tp, tp G TL, then their boolean combinations tp V tp -up are in TL. 
The other Boolean connectives: A, — <->, . . . can be defined in terms 
of -i and V, as usual. 

3. If <p, tp G TL, then their past tense temporal combinations • tp and 
tp Since tp are in TL, where • tp is spelled "previously 93." 

A model of temporal logic is a sequence JVC = so, si, . . . , s n of states, each 
state being a function from £ (the same set of basic events as before) to the 
boolean values {0, 1}. Note that a state can be therefore understood as an 
atomic event from Q, and M can be thought of as a word from Q + . The 
states of JVC are ordered by <, and s + 1 denotes the successor state of s. We 
adopt the convention that, unless explicitly indicated otherwise, a model is 
always of length n + 1, and thus n is always the last state of a model. 
For every state s of M we define inductively what it means that a formula 
tp G TL is satisfied in the state s of JVC, symbolically JVC, s\= tp. 

1. M,s \= a iff s(a) = 1 
2. 

M, s |= -xp : <^=>- JVC, s ^ V 3 ; 
JVC, s |= tp V V : M,s|=(/3orM,s|=i/>. 



JVC, s |= • tp : 
JVC, s |= 9? Since if) : 



s > and JVC, s — 1 |= tp; 

(3t < s)(M,t \= if) and (Vt < w < s)M,w \= tp). 



The syntactic abbreviations ■ tp and ♦ tp are of common use in TL. They 
are defined by ♦ tp = trueS\ncetp and Mtp = -\^—\tp. The first of them is 
spelled "once 93" and the latter "always in the past tp" . 
Their semantics is then equivalent to 

JVC, s \= M tp <^ (it < s)JVC, t \= tp; 
M, s \= ♦ tp <^ (3t < s)JVC, t \= tp. 



Theorem 1 (see [ Eme90| ) . The set of valid TL formulas is complete in 



yS5Vl6£. The set of valid TL formulas with ■ and ♦ as the only temporal 
connectives is complete in coJMTP. 

(TL|TL) is the logic of formulas of the form {<p\ip), where tp, ip G TL. (TL|TL) 
is a 3-valued extension of TL, and {tp\ip) is 



1). true in JVC, n iff JVC, n \= tp A tp. 



4 



0). false in M, n iff M, n \= -up A V- 
X). undefined in M, n iff M, n \= -tip. 

A 3-valued Moore machine 21 is a five-tuple 21 = (Q, f2, <5, /i, qo), where where 
Q is its set of states, f2 (the same set of atomic events as before) is the input 
alphabet, qo £ Q is the initial state and 5 : Q x 17 — > Q is the transition 
function, and /i is the output function Q — ► 3. 

Formally, to describe the computation of 21 we extend 5 to a function 5 : 
Q x r2 + — ► Q in the following way: 

x, x J^O?,™) if \w\ = l 

o{q,w) = < 

\5(5(q,v),io) if w = vuj. 

21 computes a function /gj : J7 + — > 3 + defined by 



f%{uJiio 2 ...uj n )= h(S(q ,u 1 ))h(6(qo, wiw 2 )) • • • /i(<5(<? , wi^ 2 • 
(note that |/a(wiW2 • • - ^n)\ = n ) as desired) 

We picture 21 as a labeled directed graph, whose vertices are elements of Q, 
labeled by their values under h, the function 5 is represented by directed 
edges labeled by elements of the edge labeled by to € £1 from q € Q leads 
to 5(q,u>). The initial state is typically indicated by an unlabeled edge "from 
nowhere" to this state. 

As the letters of the input word w € $7 + come in one after another, we walk 
in the graph, always choosing the edge labeled by the letter we receive. At 
each step it reports to the outside world the value h(q) of the state q in 
which it is at the moment. 

Drawing Moore machines, we almost always make certain graphical simpli- 
fication: we merge all the transitions joining the same pair of states into 
a single transition, labeled by the union (evaluated in S) of all the labels. 
Sometimes we go even farther and drop the label altogether from one transi- 
tion, which means that all the remaining input letters follow this transition. 
It is known for deterministic finite automata [HU79|, and extends easily 



to Moore machines with the same proof, that for any such device there 
is a unique (up to isomorphism) minimal (with respect to the number of 
states) device of the same kind, which accepts the same language (computes 
the same function, respectively). Moreover, this minimal device can be 
obtained from any such device as a quotient automaton/machine, i.e., by 
dividing the state space by some equivalence relation. For details, including 
a very efficient algorithm to perform minimization, see [HU7S]. 
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Definition 2. A Moore machine 21 is called counter-free if there is no 
word w G VL + and no states qi, q%, . . . , q s , s > 1, such that 5(qi,w) = 
q 2 , ■ ■ .,5(q s -i,w) = q s J(q s ,w) = q\. 

Sometimes we use an extension of Moore machines — Moore machines with e- 
moves. In a deterministic finite automaton an e-move is a transition between 
two states done without intervention of any letter from the input. By neces- 
sity, to maintain the deterministic character of the automaton, an e-move 
must not be combined with any other transitions starting from the same 
state. For Moore machines, we adopt the convention that after performing 
an e-move, no symbol is appended to the output. 
E.g., for the Moore machine with e-moves 21 below 




Q 



we have /a(www) = 0_L0 and f%{ojQuj) = 01_L. 

It is known that any function computable by a Moore machine with e-moves 
can be computed by a Moore machine without e-moves. Thus using e-moves 
we do not achieve greater generality. However, some transformations of the 
machines can be very conveniently represented by introducing e-moves. 

For us, Markov chains are a synonym of Markov chains with stationary 
transitions and finite state space. 

Formally, given a finite set / of states and a fixed function p:/x/-> [0, 1] 
satisfying (Vi E I) YljeiP(.hj) = -*-> the Markov chain with state space 
/ and transitions p is a sequence X = Xq, Xi, ... of random variables X n : 
W — > /, such that 

Pv(X n+1 =j\X n = i)=p(i,j). (1) 

The standard result of probability theory is that there exists a probability 
triple (W, 971, Pr) and a sequence X such that (|l|) is satisfied. W is indeed 
the space of infinite sequences of ordered pairs of elements from /, and Pr 
is a certain product measure on this set. 

One can arrange the values p(i,j) in a matrix II = (p(i,j);i,j G J). Of 
course, p(i,j) > and Y^jei P(h j) = 1 f° r everv i- Every real square matrix 
II satisfying these conditions is called stochastic. 

The initial distribution of X is that of Xq, which can be conveniently rep- 
resented by a vector Ho = (p(i)',i £ I). Its choice is independent from the 
function p(i,j). 
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It follows by a simple calculation that 



Pr(X n+ i = j\X n = i„,X n _i = £ n _i, ••• ,X X = i\) = Pi(X n+1 = j\X n = i), 

(2) 

which is called the Markov property. 

For our purposes, it is convenient to imagine the Markov chain X in another, 
equivalent form: Let Kj be the complete directed graph on the vertex set /. 
First we randomly choose the starting vertex in /, according to the initial 
distribution. Next, we start walking in Kj; at each step, if we are in the 
vertex i, we choose the edge to follow with probability p(i,j). If we 

define X n = (the vertex in which we are after n steps), then X n is indeed 
the same X n as in ([[]). 

So we will be able to draw Markov chains. Doing so, we will often omit 
edges with p(i, j) = 0. 

2.2 Conditional objects and conditional events 

Let for (ip\Tp) G (TL|TL) the function c = cr^) : + — > 3 be defined by 



c(w) 



1 if (if\ip) is true in (w,n) 
if (if\ip) is false in (w, n) 
A. if (<y?|V') is undefined in (w, n) 



C is the set of all functions c 
the set of all functions c+ : ST 
Moore machines. 

C and C_|_ are isomorphic under the mapping C + 3 c 



> 3 definable in (TL|TL), and C + is 
computable by counter-free 3-valued 



c G C defined by 



c(w) = last-letter-of(c + (u;)). 

The sets C and C+ are regarded as two representations of conditional objects. 
We have yet another representation, denoted 6^: it consists of functions 
Q°° — > 3°°, and Cqo, the third representation of the same conditional, is an 
infinite sequence of values of c on all finite nonempty prefixes of w. 
We will be using the name conditional events to refer to conditionals con- 
sidered with a probability space in the background. 



Definition 3 (Conditional event [ TRHOljl ). Let c € C be a conditional 
object over Q and let 21 be any counter- free Moore machine with output 
function h, computing c + . Suppose is endowed with a probability space 
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structure (Q, 9Jt, Pr). The conditional object c becomes then a sequence [c] = 
[c]i, [c]2, • • • of random variables [c] n : 17°° — > 3, defined by the formula 



[c] n (u>) = c(prefix-of-length-n-of(u;)), (3) 

where fi 00 is considered with the product probability structure. 
We call [c] the conditional event associated with c. 

Moreover, in presence of probability space structure 21 becomes a Markov 
chain X(2l) (by replacing labels of th e transitions by their probabilities 
under Pr in the diagram of 21), and then [c] = h(X), where h is the output 
function of 21. 

In particular, Pr([c] n = 1) is the probability that at time n the conditional 
object c is true, Pr([c] n = 0) is the probability that at time n the conditional 
object c is false, and Pr([c] = _L) is the probability that at time n the 
conditional object c is undefined. 

Definition 4 (asymptotic probability [fTRHOlfl ). We define the asymp- 
totic probability at time n of a conditional event c € C by the formula 



Pr (c) ~ = 1] (4) 

Flnlcj -Pr(ic1 n = 0orl)- W 



If the denominator is 0, Pr n (c) is undefined. 
The asymptotic probability of c is 



Pr(c) = lim Pr n (c), (5) 

n— >oo 

provided that Pr n (c) is defined for all sufficiently large n and the limit exists. 
If ip E TL then we write Pr(<£>) for Pr((ip\true)). 



Theorem 5 (Bayes' Formula | TRH0l]| ). Let (y|V0 ^ e a conditional ob- 
ject over $7 endowed with a probability space structure ($1,971, Pr), and let 21 
be any counter-free Moore machine with output function h, computing ((p\4>)- 

1. For every state i o/X(2l) the probability lim n _ >00 Pr(X n = i) exists. 

2. For every * G 3 the probability linin^oo Pr([(</?|i/>)] n = *) exists. 

3. The Bayes ' Formula 



holds whenever the right-hand-side above is well-defined, i.e., Pr(^) > 0. 
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3 Connectives of conditionals 

If we wish to extend the classical two- valued conjunction to conditionals, 
we are faced with the problem of synchronization. Indeed, what is easy 
in 2-valued world becomes messy in 3-valued world. The problem is that 
the conditionals need not become defined synchronously. For the classical 
conjunction this problem does not exist, because both arguments are always 
defined. Now we have to resolve the question how to define the conjunction, 
when some of the arguments are undefined. 

3.1 Present tense connectives 

Let us recall that present tense connectives are those, whose definition in 
(TL|TL) does not use temporal connectives, and therefore depends on the 
present, only. Equivalently, an n-ary present tense connective is completely 
characterized by a function 3 n — > 3. 

Here are several possible choices for the conjunction, which is always defined 
as a pointwise application of the following 3 valued functions. Above we 
display the notation for the corresponding kind of conjunction. 



X A S AC y 


x\y 


1 JL 








1 


1 1 


_L 


1 J_ 



x a G nw y 


x\y 


1 _L 








1 


1 1 


_L 


J_ J_ 



x A S ch y 


x\y 


1 


_L 








_L 


1 


1 


_L 


_L 


_L _L 


_L 





X 


->x 





1 


1 





_L 


_L 



x v S ac y 


x\y 


1 


_L 





1 





l 


1 1 


1 


_L 


1 


_L 



x v G nw y 


x\y 


1 _L 





1 J_ 


1 


1 1 1 


_L 


_L 1 _L 



x V Sch y 


x\y 


1 


_L 





1 


_L 


1 


1 1 


_L 


_L 


_L _L 


_L 



They can be equivalently described by syntactical manipulations in (TL|TL). 
The reduction rules are as follows: 



9 



(a | b) A S ac (c\d) = {abed V abd C V cdb G \b V d) 

n n 

(a 1 6) Agnw (c|rf) = (abcd\a d V c d V a&cd) 
(a 1 6) Asch (c|d) = (a6cd|&d) 

~ (a|6) = (a C |6) ( 6 ) 
(a\b) V S ac (c|d) = {ab V cd|6 V d) 
(a 1 6) V G nw (c|d) = (06 V cd\ab V cd V fed) 
V Sc h (c|d) = (afe V cd\bd). 

The first is based on the principle "if any of the arguments becomes defined, 



act!". A good example would be a quotation from [ Cal97[| : 



"One of the most dramatic examples of the unrecognized use 
of compound conditioning was the first military strategy of our 
nation. As the Colonialists waited for the British to attack, 
the signal was 'One if by land and two if by sea'. This is the 
conjunction of two conditionals with uncertainty/'^ 

Of course, if the above was understood as a conjunction of two conditionals, 
the situation was crying for the use of AsaC) whose definition has been 
proposed independently by Schay, Adams and Calabrese (the author of the 
quotation). 

The conjunction Agnw represents a moderate approach, which in case of an 
apparent evidence for reports 0, but otherwise it prefers to report unknown 
in a case of any doubt. Note that this conjunction is essentially the same as 
lazy evaluation, known from programming languages. 

Finally, the conjunction As c h is least defined, and acts (classically) only if 
both arguments become defined. It corresponds to the strict evaluation. 
We have given an example for the use of Asac- The uses of Agnw and As c h 
can be found in any computer program executed in parallel, which uses 
either lazy or strict evaluation of its logical conditions. And indeed both of 
them happily coexist in many programming languages, in that one of them 
is the standard choice, the programmer can however explicitly override the 
default and choose the other evaluation strategy. 

This seems to suggest that neither of the three choices discussed in this 
paragraph is the conjunction of conditionals. There are indeed many possible 
choices, and all of them have their own merits. And indeed already the 
original system of Schay consisted of five operations: ~ q, Asac> VsaCi ^Sch 
and Vsch- Moreover, he was aware that these operations still do not make 
the algebra functionally complete (even in the narrowed sense, restricted to 

1 NB, if the British had decided to attack from both directions, but not simultaneously, 
we would have probably discovered temporal conditionals much earlier. 
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defining only operations which are undefined for all undefined arguments). 
And in order to remedy this he suggested to use one of several additional 
operators, one of them being Aqnw' So for him all those operations could 
coexist in one system. 



Present tense re-conditioning Calabrese [pa!90 | and Goodman, Nguyen 
and Walker [GN95] proposed their own extensions of the conditioning oper- 
ator to 3, hence making it available for re-conditioning in SAC and GNW, 
respectively. The definitions are 



x\sacv 


x\y 


1 


_L 





_L _L 


_L 


1 


1 


_L 


_L 


1 


_L 



x\gnwV 


x\y 


1 


_L 





_L _L 


_L 


1 


1 


_L 


_L 


_L 


_L 



3.2 Past tense connectives 

Now we consider connectives, whose definitions refer to the strict past of 
their arguments. We continue to consider conjunction, which we use as a 
kind of model example. 



Examples of past tense connectives The following are connectives very 
close to the conjunction and disjunction of the product space cea introduced 
in | Goo94f p[ defined by the rule: the conjunction is defined and true iff 
both of its arguments have been defined, and moreover the historically first 
values of its two arguments have been both 1. Otherwise it is defined and 
false. Disjunction is defined similarly. They use the "Russian roulette" 
approach to repeating experiments. 

In the language of (TL|TL) (a\b) Aps (c\d) can be expressed by 



♦ (a Ab A 



A 



♦ (c A d A 



'♦*)) 
♦ <*)) 



true 



(7) 



and the definition of and (a\b) Vps (c\d) is similar. They seem complicated, 
but can be simplified, what we do below, and the Moore machine represen- 
tations are again much simpler and easier to analyze. 

To simplify the (TL|TL) representation above, let us define first(o|6) to be 



first (a| 6) := (♦(» A b A -> • ♦ b))\true). 
2 See the discussion on embedding cea's in our model below. 



(8) 
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It is convenient to denote by frrst2 (a 1 6) the first argument of first (a\b). 
Then the conjunction can be easily and effectively described as (first2(a|fr) A 
first2 (c\d) | true) (where A is the classical conjunction of temporal logic). 
The minimal Moore machine of first(a|6) is depicted in Fig. ^ below. 

4 Embedding of existing cea's and their incom- 
pleteness 

In this section we want to discuss the problem of embedding existing cea's 
into our model, and on that basis, the problem of defining natural connec- 
tives among conditionals in general. 

4.1 Syntax 

We assume the following syntax of the flat conditional expressions. The set 
of all such expressions will be denoted £ . 

The set of these expressions is the smallest set, containing all simple condi- 
tionals of the form (x\y), where x,y € fi, and closed under two-ary (infix) 
operations A, V and one unary prefix operation ~. 

If we require the closure under one additional binary operation (-|-) (which 
shouldn't be mixed up with the parenthesis-bar-parenthesis construction 
appearing in simple conditionals), we obtain the set of full conditional ex- 
pressions, denoted £J . 

4.2 Conditional event algebras 

According to |GMN97[| , a conditional event algebra (cea in short) over a 
probability space (0, 9Jt, Pr) is a space (but not necessarily a probability 
space) (fi , Wl , Pr Q ), extending (Q,9Jl, Pr), together with a function (-|-) : 
SDT x Wl -» Wl such that 

• Wl is an algebra of the signature of boolean algebras. 

• (a 1 6) = (an b\b) for all a, b € SDT. 

• The subalgebra of (0 o; 9Jt , Pr G ) consisting of the elements (a|Q) is 
isomorphic to (0, 97t, Pr) under the bijection a (a|f2). 

• Pr ((a|6)) = Pr(a n b)j Pr(6) for a, b G 9Jt, Pr(6) > 0. 

• Certain equalities hold among the Pr -probabilities. 

• The Pr -probabilities for n, U and ^ of elements of the form (aj|6j) for 
a{,bi £ 9JI are effectively computable from the set of Pr-probabilities 
of all the boolean combinations of the elements dj, b{. 



abed 



Figure 1: Moore machine of {{(a\b) A (c\d))) ps 
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How we understand cea's. It is readily seen, that any particular cea 
over any (f2, Pr) can be equivalently considered as a mapping assigning 
elements of SC7t and probabilities to flat conditional expressions. In this 
sense, cea is a kind of a model of a logic, whose syntax are the flat conditional 
expressions. This is the way we understand cea's and this is the level on 
which we will criticize them. 



Probabilistic models and cea's. First of all, we do not think that the 
algebraic structure of a cea is particularly important. The language of flat 
conditional expressions does not have equality, so what is really crucial are 
the probabilities. The algebraic structure can indeed be an obstacle while 
assigning probabilities (this is perhaps why the authors of the earlier papers 
devoted so much attention to it), but otherwise we are not so much interested 
in it. 

Next, what a cea assigns to a conditional expression is definitely too little. 
Apart from an element of SDTo and probability, an experiment should be de- 
termined to verify experimentally the value of the probability. This means, 
that the objects assigned to conditional expressions should be events in a 
probabilistic space, i.e., the triple (0 o , Wl Q , Pr G ) should be indeed a proba- 
bility space. Moreover, the experiments for simple conditionals (a\b) should 
correspond to the natural experiments one performs to learn conditional 
probabilities. In such experiments one can typically measure another proba- 
bilistic parameters, like, e.g., the probability that (a\b) is defined. We think 
such additional parameters should be assigned to compound conditional 
events, too. Of course, in the existing algebras we have hints concerning it, 
hidden in their universes and other details of the constructions, provided by 
the inventors, but the very definition of a cea does not require the additional 
parameters to be even defined, let alone to satisfy any reasonable properties. 
Strictly speaking, the signature of cea's is too small. In its present shape 
it permits existence of other, isomorphic algebras, where all the additional 
information is lost. 

Last, but not least, in the applications of classical probability theory one of- 
ten encounters problems in modeling, typically of the following form: one has 
an event, whose meaning is completely clear (it is known, when it happens 
and when it doesn't), but there is a problem of specifying the probability 
space structure, and sometimes different choices lead to different values for 
the probability of the same event. In such circumstances one can only experi- 
mentally decide which of the models is the correct one. A standard example 
is the difference between the so called statistics of: Maxwell-Boltzmann, 
Bose-Einstein and Fermi-Dirac, considered in quantum physics, and the un- 
successful search for any elementary particle, which would satisfy the first 
statistics, seemingly the most natural one among them (therefore we have 
bosons and fermions, but we do not have maxwellons in physics) Fel68]. 
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The tremendous success of probability theory in applications seems to sug- 
gest this is the right way of creating a mathematical model of a real life 
situation. However, the cea offers us another challenge. Even after coming 
up with the proper model of probability space of unconditional events, one 
still has a lot of work to choose the right model for conditional events. 
In plain words, it means that the structure modeling conditional events over 
a given probabilistic space of unconditional events, should be functorial: 
given the former space, the space of conditional events ought to be uniquely 
determined. Here the cea's again fall short of satisfying this requirement, 
because there are many known cea's, and each of them has its own defini- 
tion of (a\b) A (c\d), with its own probability, and all of them derive their 
definitions from certain first principles. The answer is almost obvious — the 
signature is too small, and should consist of many different conjunctions, 
disjunctions and negations, and perhaps lots of other connectives, which do 
not have any natural counterparts in the nonconditional world. 



(TL|TL) as a solution. We believe that our system of temporal condi- 
tional events addresses all of the problems we have indicated above. 
Our conditional events belong to a normal probability space. They are 
indeed stochastic processes — projections of Markov chains, and all their 
probabilistic properties (many more than just the bare probability) can be 
verified experimentally, 

To the contrary, all the cea's we call present tense in this paper have a natural 
representation as algebras of 3-valued indicator functions |GNW91 , Chapter 
3] . Recalling that every event from £ can be equivalently characterized by its 

2- valued indicator random variable, which is nothing but the characteristic 
function, we should naturally expect that the lifting of Pr to the space of 

3- valued indicator functions, should consist of 3-valued random variables, 
while the definition of a cea requires just elements of a strange algebra 
9JT and numbers. In case of our temporal conditionals, we naturally expect 
conditionals to be functions — > 3°°, and hence the lifting of Pr to consist 
of random functions, which are nothing but stochastic processes, the choice 
we actually have made. This is the natural pattern one should follow, and 
this is where the scalability of the model is hidden. E.g., nothing could 
prevent us from considering continuous time stochastic processes as models 
of conditional events, if need be. 

The space of conditional events is uniquely determined by the probabilistic 
space of nonconditional events, and has many more natural connectives than 
just three. In fact, we are not very original here: already Schay [3ch65] 
proposed a system with five connectives, and considered adding even more 
of them. 

Finally, to make our argument complete, we consider the major cea's below, 
and show that they embed in our models, in many different ways. For 
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the product space cea, we employ the embeddings to demonstrate that this 
cea has some further defects. The existence of multiple embeddings shows 
that the structure of a cea's isn't functorial just because there are many 
known cea's, but because the structure of a cea does not prescribe the way 
the conditionals relate to nonconditional events. Additionally, the product 
space cea is unable to characterize independence of conditional events by 
means of equalities of probabilities. 



4.3 Present tense cea's 

In our framework, there is a distinctive class of cea's, which we call present 
tense cea 's. The definitions of their connectives refer to the present of the 
process, only, hence the name. Consequently, such a connective applied to 
simple conditionals yields simple conditionals, again. Among such systems 
are SAC, proposed independently by Schay |]Sch6£ ] , Adams | Ada86| and 



Calabrese [Cal87] (who extended it by an operator for re-conditioning), and 
GNW proposed by Goodman |Goo87|1 and Goodman, Nguyen and Walker 
[ GNW9l| ] (later Goodman and Nguyen [ GN95 ] proposed a re-conditioning 



operator for GNW). They assume that all boolean combinations of simple 
conditional expressions yield simple conditional expressions again. The def- 



initions of their connectives are given in Section |3.1| . They do form cea's. 
Equivalently, |GNW9l| , Chapter 3], these algebras can be characterized as 
algebras of 3- valued indicator functions, and their connectives are then char- 
acterized by mappings from certain Cartesian power of 3 into 3. 
We take the second point of view, and define their semantics as follows. 
Our definition assigns to every conditional expression e a 3-valued indica- 
tor function ((e))sAC and ((e))cNW) respectively. For us, indicator functions 
are nothing else than present tense conditionals. (Originally these algebras 
do not involve time.) So, we give the definition by describing translations 
((•))sAC) (('))gnw : £ - » 6. We use in the translations present tense connec- 
tives of conditionals defined in (^). 
The definition of SAC: 



«H&)»SAC = (a\b), 
((e A e')) S AC = ((e))sAC A S ac ((e'))sAC, 
«e V e'))sAC = ((e))sAC V S ac ((e'))sAC, (9) 

((~e))sAC =~o((e))sAC, 
(((e|e')))sAC = (((e))sAc|sAc((e'))sAc). 
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The definition of GNW: 

(((#)))gnw = (a\b), 

((e A e'))cNW = ((e)) gnw A G nw ((e'))GNW, 
((e V e')) G NW = ((e)) gnw V G nw ((e'))cNW, (10) 
((~e)) G NW =~oWgnw> 

((( e l e ')))GNW = (((e))GNw|GNw((e'))cNw)- 

Given a probability space (O, CP(O), Pr), the probability of a conditional 
expression e is Pr S Ac(e) = Pr(((e))sAC = l)/Pr(((e))sAC / -L), and similarly 
PfGNw(e) = Pr(((e)) G NW = l)/Pr(((e)) G NW ¥= provided that the denom- 
inators are nonzero. 

All theses systems are readily seen to embed in our system of conditionals. 
In fact, if one represents them in the form of reduction rules, as in @, they 
do even embed syntactically in the (TL|TL) logic. They are present tense 
because they do not contain temporal connectives. 



4.4 Product space cea 

However, there is another cea, called the product space cea, which is not 
present tense. In order to analyze it and show that it can be interpreted in 
our model, we have to give the definition. 
The semantics is as follows: 

Beginning with (0, 971, Pr), we form its countable power f2°° endowed with 
the product measure. The cylinder b x • • • x b xa x x O x ■ ■ • C for 

3 

a, b € £ is denoted b 3 x a x Ct. 

Define the semantics function ((-))ps : PS — > J^fi 00 ) by 



«(o|ps&)))ps = U^\ 6 ) J x(&ria)xO, 

i=0 

((e A e'))ps = ((e)) PS n ((e'))p S , 
((eVe / ))ps = ((e))psU((e'))ps, 
((~e))ps = fi 00 \((e))p S . 

Wl of the product space cea is then the subalgebra of the (boolean) algebra 
CP(Q°°), U, n, \ •)), generated by all elements V xaxCl where a,kE, 
and Pr Q is the product measure. 



There are indeed two versions of PS: one defined in the paper [Goo94], where 
equality of two conditionals is understood as true equality of sets, and an- 
other, defined in |GMN97 1 , where the equality of conditionals is understood 
as equality almost everywhere, i.e., two conditional events of PS are equal 



17 



iff their symmetric difference has probability 0. The latter is therefore not 
logical, since it depends on the particular probability space structure. 
The probabilities assigned to the elements of PS are those according to the 
infinite product of Pr . 

4.5 First embedding 

In order to construct the first embedding of PS into S by defining two 
operations cr(-) : £ — > TL and r(-) : £ — > (TL|TL) as follows: 

a({a\b)) = first 2 (a|6) 
a(e A e ) = a(e) A cr(e') 

a{e V e') = <r(e) V a(e') (11) 
a(~e) = _| cr(e) 
r(e) = (<r(e)|trae). 

r(e) (or, more formally, the conditional from C represented by the former) 
is the desired embedding. 

Lemma 6. For every expression e G £ 

1. For every word w £ holds (r(e))oo(ti;) £ 2°°; 

2. Suppose (ai|6i), . . . , (a n |6 n ) are all simple conditionals occurring in e. 
Suppose w € f2°° is so that bi t , . . . , bi k are all events among b± , . . . , b n 
which happen in the sequence w, and all of them happen not later than 
at time m. Then of the word (T(e)) OCJ (w) is constant beginning since 
time m. 

3- (( e ))ps = {w £ Q°° I (T(e)) oc (w) is eventually constant 1}; 
4. Pr D (e) = Pr(r(e)); 

Proof. The proof of 1., 2. and 3. goes by simultaneous induction w.r.t. e. 
For e = (a\b) they follows from a simple analysis of the definition of ((e))ps 
and first2(a|6). 

Now consider e = ->e' and assume by induction that 1., 2. and 3. hold for 
e'. Since er(^e) = -icr(e'), we have 1. and 2. immediately. 
Moreover, (^a(e')\true) 00 (w) is eventually constant 1 iff (a(e')\true) 00 (w ) is 
eventually constant 0, which by 1. and 2. for e' is equivalent to the fact that 
{a{e')\true) OQ {w) is not eventually constant 1. This concludes the induction 
step of 3. 

Induction steps for the other connectives are equally simple. 

We turn now to 4. Suppose (ai |&i), • • • , (a n \b n ) are all simple conditionals 

occurring in a conditional expression ip. W.l.o.g. assume Pr(6j) > for 
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i = 1, . . . , k and Pr(6j) = for i = k + 1, . . . , n. (We permit k = and 
k = n, in which cases either all b{ are impossible, or all of them have positive 
probability.) 

Represent Q°° as a disjoint union of sets 



A — \ w £ / • • • ! bk happen in w and the first time , 
n when they all have already happened is n 

and the set Aoo := {w G f2°° / not all of b\, . . . , happen in w}. All these 
sets are clearly measurable. 

It is not hard to see that the (product) probability of Aoo is 0, so Pr G (e) = 
neN Pr(Ai)- 

It is not hard to verify, either, that 



lim V Pr n (A m ) = 1. 

n— >oo £ — * 

These two equalities imply 4. immediately, since for every n 

^n(A m ) < Pr„(r(e)) < 1 - Pr n (A 

mgN m£N 
AmC((e» PS A m n({e))p S =0 

The following theorem follows now instantly. 



□ 



Theorem 7. r(-) is an embedding of the PS cea mio C, m i/ie sense that 
for any underlying probability space, and any conditional expressions e,e', 
r(e) = r(e') ((e)) PS = ((e'»PS, and Pr(r(e)) = Pr c (e). ^ □ 

4.6 Reverse embeddings 

Definition 8. The reverse of a word u> = u\ ■■■0J n G denoted w R , is 
Now consider a conditional c G C. Then c R G C is a conditional defined by 



c R ( w ) := c(w R ). 

The class of languages definable in TL is reverse-closed |Eme9Cf| , i.e., if 
L = {w I w, \w\ \= if} for some G TL, then L R = {w R / w G L} = 
{w I w, \w\ \= for somey? G TL. It follows that c R is indeed a conditional 
in our sense. 
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Theorem 9. For every * G 3 

Pr([c] n =*)=Pr([c R ] n =*). 
Consequently, Pr G (c) = Pr(c R ). 

Proof. Pr is understood here as a product measure, which is insensitive to 
the order of its coordinates. □ 

It follows that any cea, which can be at all isomorphically embedded in our 
stochastic process model, has at least two embeddings, which are reverses 
of each other. The only exception is when the embedding is invariant under 
reverse, i.e., when each conditional c in the image of the embedding satisfies 
c(w) = c(w R ) for all w € However, it seems unlikely that any reasonable 
embedding has this property. In particular, the natural embeddings of the 
cea's we consider here are not of this kind. 

As a matter of example, we consider here PS. For the PS conjunction, its 
informal description of its reverse representation in C is that it is always 
defined and true iff the most recent defined values of its both arguments 
were 1. 

In (TL|TL), we have that T R ((a\b) A (c\d)) (the reverse of the embedding 
t(-) defined in fll!]) ) is defined by (($ Since(a A b)) A (dP Since(c A d))\true), 
whose Moore machine is depicted below. 

The precise definition of the reverse embedding of PS is as follows: first, 
we take the original conditional expression e = e((di|&i), . . . , (a m \b m )) and 
replace every (aj|foj) occurring in it by 6^Since(oj A obtaining e' G TL, 
and then define T R (e) := (e'\true). Formally: 

a R ((a\b)) = 6 C Since(a A 6) 
(j R (eAe')=(r R (e)A ff R (e') 

ff R (eVe') = (j R (e)V(j R (e') (12) 
cr R (~ e ) = ^cr R (e) 
r R (e) = (a R (e)\true). 

For this particular embedding, we have the following consequence of Theo- 
rem ^ (and the simple fact that reversing is an automorphism of the whole 
(TL|TL)) 

Theorem 10. r R (-) is an embedding of the PS cea into C, in the sense that 
for any underlying probability space, and any conditional expressions e,e', 
r R (e) = r R (e') iff «e» PS = <(e'» PS , and Pr(r R (e)) = Pr (e). □ 
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4.7 Sparse reverse embedding 

We give a new, radically different interpretation of PS in C. The main dif- 
ference is that it is not an embedding. We are going to present a way to 
interpret PS expressions in C so that, for any probability space (Q, y(f2), Pr), 
the Pr -probability of a PS-expression is equal to the asymptotic probabil- 
ity of its interpretation. However, expressions which yield equal element of 
the PS cea, may well give distinct conditional events in C, although, as said 
before, these expressions will have equal asymptotic probabilities. 
First of all , we redefine the meaning of simple conditionals (a\b). According 
to the new embedding, they are represented by (TL|TL) formula 

(a A -.(■-.& V true)\b V -•♦&). 
Denote this formula by (a\sb). 

It is essentially the simple conditional (a\b) G (TL|TL), except that it is 
defined and false until b becomes true for the very first time, and since then 
behaves exactly like (a\b) does in (TL|TL). The manipulation is necessary 
to accommodate the PS principle, that degenerate simple conditionals, like 
(a|0), do have probability, and that it is 0. 

We now define the sparse reverse interpretation of PS in C as follows: First, 
we take the original conditional expression e = e((ai|&i), . . . , (a m |6 m )) and 
set r RS (e) = (cr R (e)| V™ iOi V -. ♦ fy)), where a R (-) has been defined in (|l|). 
The conjunction and disjunction are defined precisely when at least one of 
the arguments is defined, so they resemble the connectives of SAC in this 
respect, but instead of assigning the other arguments default values when 
they are undefined, like SAC does, their most recent defined values are 
always used, instead. Here is an example Moore machine, in which we use 
e-moves. The graphical representation in Fig. || and Fig. || shows that the 
new operation is closely related to the reversed product space conjunction, 
as can be expected from the shape of the (TL|TL) representation. 

Lemma 11. If Pr(6j) = for at least one 1 < i < m, then Pr(r RS (e)) = 
Pr(r(e)). 

Proof. In this case, assuming Pr(6j) = 0, we have that VI^i(^i V is 
true with probability 1, hence [T RS (e)J = [r R (e)J with probability 1. Now 
the thesis follows immediately from Theorem |l0[ □ 

Lemma 12. IfPr(bi) > for all 1 < i < m, then Pr(r RS (e)) = Pr (r(e)). 

Proof Let e = e((oi|6i, . . . , (a n \b n )). We prove Pr(r R (e)) = Pr(r RS (e)), 

which is, by Theorem [l(], equivalent to what we have to show. 

Denote t = the first moment m when all the 6j's have already been defined. 



22 




Figure 4: Moore machine of r RS ((a|6) A (c\d)), part 1. This part of the 
machine is the transient part of the Markov chain, when Pr(6),Pr(d) > 



(Lemma 12) and the whole reachable part when at least one of these 
probabilities is (Lemma p]). 

Subscripts of the state labels indicate the most recent value of (a\b) and 
(c\d), respectively. 
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Figure 5: Moore machine of r RS ((a|6) A (c\d)), part 2. This part of the 
machine is the (only) ergodic class of the Markov chain, when Pr(6), Pr(d) > 
(Lemma |l2|) , and is unreachable when at least one of these probabilities 



is (Lemma 11). 

Subscripts of the state labels indicate the most recent value of (a\b) and 
(c\d), respectively. 
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Ooi 


b^cd 
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Ooo 
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0±i 
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0±o 


Oio 
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Ooo 





Figure 6: Moore machine of t ((a\b) A (c\d)), part 3. This part of the 
machine is the table of the transitions from the "transient" part (Fig. to 
the "ergodic" part (Fig. ||). 

Let us note that, if t < n, then the event [T RS (e)] n = _L is independent 
of the whole history of |/r RS (e)] up to time n — 1. This is so because the 
decision whether [T RS (e)]„ is defined or not depends solely on the present 
time values of b^s, and their present time values become independent of the 
(strict) past, when the condition ViLi ^ ^ k becomes for the first time false, 
because it remains then false forever, and the "given" part of r RS (e) does 
not contain any other time modalities. 

Moreover, whenever [r RS (e)](a;i . . . u) n ) ^ _L, then in fact r RS (e)(wi . . . uj n ) = 
T R (e)(toi . . . uj n ), which is clear from the syntactic representation of both 
conditional objects. 
Denote for convenience 

q = Pr([r RS (e)] n = ±\t < n) = l-Pr(V™i k) < 1, as well as and c = r R (e) 
and c s = T RS (e). 

Fix e > 0. Let M be large enough to have Pr(i > M) < e. Let N be a large 
integer, and let n satisfy n — N > M. 
We have then by the independence 



N 

Pr([c] n = 1) > Pr ([ c V* = 1) Pr([c s ]„_ i+1 = X) . . . Pr([c s ]„ = X) 
i=0 

AT 

Pr([c] n = 0) > J] Pr([c S ] n _, = 0) Pr([c S ] n _ m = X) . . . Pr([c S ] n = X) 

i=0 

where the £ error terms are caused by the event that t > M. 

Because each of the Pr(. . . ) expressions above tends to a limit as n 

proaches infinity, we get 

N 

Pr(c) > V lim Pr([c S ] n _ 4 = 1) lim Pr([c S ]„_ m = X) ■ • ■ Pr([c S ] n = 

i=0 
N 

= V lim Pr([c S ] n = 1) lim Pr([c S ]„ = _L) ■ • ■ Pr([c S ] n = J.) - e 

* — • n^oo n— >oo 
i=0 

N 

= V lim Pr([c S ] n = 1)( lim Pr([c S ] n = X))* - £ 
i=0 
N 

= V lim Pr([c S ] n = l)(f -£ 
i=0 

1 - fl^ 

= ^L_ l im Pr([c s ] n = l)-£, 
and similarly 

l-a N 

1 - Pr(c) > — lim Pr(|c s ] n = 0) - e. 

1 — q n— »oo 

In the limit N — ► oo both inequalities become 

Pr(c) > — *— lim Pr([c s l„ = 1) - e 

L — q n^oo 

1 - Pr(c) > — !— lim Pr([c s l„ = 0) - e, 

1 — ^ n-^oo 

hence 
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Pr(c) > — !— lim Pr([c s ] n = 1) - e 

— lim Pr([c s l n = 0)+e 



1 




q 


1 










l 




1 




1 








1 




1 
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(1-g- lim Pr([c s ] n = 0))+e 

71— >00 

lim Pr([c s ] n = l) + e, 

n— >oo 

because 1 = Pr([c s ] n = 1) + Pr([c s ] n = 0) + Pr([c s ] n = _L), in which the 

last term is constant equal q. 

We took an arbitrary e > 0, therefore indeed 

Pr(c S ) = — !— lim Pr([c S l„ = 1), 

1 — q n^oo 

and likewise 



1 _ p r ( c S) = J_ i im Pr([c b J n = 0). 
1 — q - 



n— >oo 



From the last two equalities the equality Pr(c) = Pr(c s ) follows immediately. 

□ 

Summing up, 

Theorem 13. For every conditional expression e and every probability as- 
signment to the elements in CI, Pr Q (e) = Pr(r RS (e)). □ 

The very important consequence of the theorem is the following: 

Corollary 14. The formalism of PS cea ; seen as a logic of conditionals, 
is unable to determine certain probabilistic characteristics, other than the 
asymptotic probability, associated with stochastic processes. 

Proof. PS possesses two interpretations in C, one of which consists entirely of 
always defined temporal conditionals (the r(-) embedding), while the second 
contains conditionals which are defined with asymptotic probability strictly 
less than 1 (the t rs (-) interpretation). □ 

Another similar example of deficiencies of the PS cea can be found below, 
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Let us note that the t rs (-) interpretation does not preserve the algebraic 
structure of the PS cea, in general. Indeed, already (((0|a)))ps = (((0|6)))ps, 
while T RS ((0|a)) = (a\sb) and r RS ((0|&)) = (0|gfe) represent different condi- 
tional objects, for a / b, a, b £ £. 
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5 Advantages of (TL|TL) conditionals 
5.1 Complexity and proof systems 

The paper |Goo94|| asks for the proof systems for various three- valued logics, 
appearing in the context of the theory of conditionals. 
In order to discuss this issue, we use the machinery of complexity theory. 
All the necessary definitions can be found in [ HU79|] . 



As we prove below, for all of the major cea's, the sets of weak tautologies 
are coMP complete. 

In the light of the above results, there is a little hope for a practically 
useful proof system for the most prominent systems among GNW and SAC. 
Indeed, unless MP = coMP, a very unlikely complexity-theoretic collapse, 
for every sound and complete proof system for the two above logics, there 
must be weak tautologies of length n such that their shortest proofs are of 
super polynomial length w.r.t. n, for infinitely many n. 
Temporal logic is known to be yS3VlC£-complete, Theorem |l[ It is therefore 
obvious that the set of weak tautologies of (TL|TL), consisting of all expres- 
sions (tp\ip) such that for every w S 3 + , (if\tp)(w) € {1, _L} (equivalently: 
that ifi — » ip is a tautology of TL), is ^PSJVlCS-complete, too. 
Although it is commonly believed that coNT C ySTAGS., from practical 
standpoint both admit exponential time algorithms, and no better ones are 
known. Consequently, the practical algorithmic difference between cea's and 
(TL|TL) is not so crucial. The advantage of considering cea's as subsystems 
of (TL|TL) stems from the fact that a lot is known about proof systems for 
temporal logic — unlike for cea's. 

We are not interested in the complexity of cea's as logics involving terms 
(a\b) as atoms, but rather as 3-valued logics. To explain the difference, let 
us note that in the calculus of any of the cea's one can easily restrict atoms 
to be two-valued (e.g., by using only atoms of the form (a|l)), and thus all 
the complexity questions trivialize, the sets of tautologies of all the logics 
are coMP-complete. Here, we assume that the atoms can always assume all 
three logical values, and are effectively variables. Hence we indeed view our 
(weak) tautologies as a kind of (weak) meta-tautologies, i.e., formulas which 
evaluate to either 1 or _L, no matter what the arguments are. 
Formally, for this section we modify £ and £ replacing simple conditionals 
(a 1 6) by variables p±,p2, ■ ■ ■ in conditional expressions. 
Likewise, given a valuation v : {pi,P2, ...}—> 3, we let ((-))sac anci (('))gnw 
assign values in 3 to expressions in £ ' . These values are determined by the 
equations in ([|) and ([To|) . 

An expression e is a weak tautology of SAC cea (GNW cea, respectively) iff 
(( e ))sAC + ((( e ))cNW ^ °> respectively) for every v. 

e is a strong tautology of SAC (GNW, respectively), iff the above values are 
1 for every v. 
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We consider the complexity problem of determining if an expression e is a 
weak tautology according to each of the considered cea's, considering also 
some syntactical restrictions put on the syntactical shape of e. 
We do not consider strong tautologies, which is explained by the following. 

Proposition 15. In GNW and SAC there are no strong tautologies. 

Proof. All connectives have value _L if all their arguments are _L, for both 
SAC and GNW. □ 

Occasionally, we want to consider £' as the syntax of classical logic. In this 
case, given a valuation v : {pi,P2, ■ ■ ■ } —> 2, the value ((e)) CL G 2 is computed 
according to the classical rules, where the conditioning | is understood as 
reverse implication: (a\b) is a <— b. 

Pure conditional parts. We consider here pure conditional fragments of 
SAC and GNW, i.e., expressions in which the only connective used is |. 
It shows that unlimited use of re-conditioning leads to coNT-completeness 
of the weak tautology problem. 

Theorem 16. It is an coNT -complete problem to determine if an expression 
e € &' involving only re- conditioning is a weak tautology o/SAC. 
It is an coN?- complete problem to determine if an expression eft' involv- 
ing only re- conditioning is a weak tautology of GNW. 

Proof. It is obvious that the sets of weak tautologies in both cases are in 
co3MTP. So it remains to prove their hardness in this complexity class. 
It is easily seen that the (re-)conditioning operators of both GNW and SAC 
satisfy the following property: the equivalence relation ~ on 3 identifying 1 
with _L is a congruence of 21 = (3, |sac) and 25 = (3, |gnw)j an d the quotient 
algebras both 21 = (3, |sac)/~ an d 25 = (3, |gnw)/~ are isomorphic to the 
2-element algebra with the reversed classical implication (2, <—}. The natural 
epimorphism 7721 : 21 — > (2, <— ) sends 1 and _L to 1, and to 0, and the 
definition of 7703 is identical. 

Therefore a pure conditional expression is a weak tautology of either of the 
considered cea's iff it is a classical tautology, after its (re-)conditioning oper- 
ator is replaced by the reversed classical implication. The classical formula 
resulting from this replacement is denoted e. 

We have to prove that e is not a weak tautology iff e is not a tautology. 
Let v be any valuation of the variables of e in 3. Now we use the natural 
epimorphism 7]<% and get 

(Wr = w(«e»§Ac)- 

So if one of the values above can be 0, the other can be, as well, which 
establishes the desired equivalence. 
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Since it is known that the tautologies of the classical propositional logic of 
pure implication are coNIP complete [p3eu9E ], the claim follows. □ 



As a by-product we have 

Corollary 17. The sets of pure conditional weak tautologies of SAC and 
GNW are identical. □ 



Flat parts. Here we consider SAC and GNW without re-conditioning. 
We can define the following 3slTP-complete problem 3CNF-SAT. 
Given: an expression e £ £ of the following syntactical form: 



e = {hi V £ 12 V £13) A (£21 V £22 V £ 23 ) A • • • A (4i V £ s2 V £ s3 ), (13) 
where each of the £ij is either pj or 

The NlP-complete problem is: given e of the above shape, determine if e is 
satisfiable, i.e, if there exists v such that ((e)) CL = 1. 

It follows that it is coN^-complete to determine, given e as above, if e is not 
satisfiable, i.e., whether (( 

^^))cl — ^ every v. 
In order to prove coN ^-completeness of the sets of weak tautologies of either 
of the cea's, we have to construct a polynomial time computable transfor- 
mation e 1— > e translating e of the form (^) into e of the form conforming 
to the restriction set in the respective theorem, and such that ~ e is not 
satisfiable in the classical sense iff e is a weak tautology of the respective 
logic. 

Theorem 18. It is an coMP- complete problem to determine if an expression 
e € £ is a weak tautology of SAC. 

It is an co!MP- complete problem to determine if an expression e £ L is a 
weak tautology of GNW. 

Proof. It is easily seen that the connectives Agnw an d Vqnw satisfy again 
the property that the equivalence relation ~ on 3 identifying 1 with _L is a 
congruence of the algebra with the above functions, and the quotient algebra 
(3, AqnW) Vgnw) / ~ is isomorphic to the classical (2, A, V). This fails about 
the negation, however. 

As the negation is applied to atoms only in 3CNF-SAT, we do not have 
to use the negation of GNW directly. Instead, we introduce new variables 
to denote the negations, and force them to behave correctly outside of the 
translation of e. 

Formally, let the mapping ei-»e' from the classical propositional logic into 
GNW be defined by replacing unnegated atoms p in e by p and negated 
atoms ~p by p. Concerning binary connectives, we leave A and V untouched. 

Then let e be defined as ~ (e' A /\(j5 V p) A (~j5V where p in the big 

p 

conjunction ranges over all propositional variables of e. 
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Certainly the mapping e i— > e is computable in polynomial time. In order to 
show the coNJ 1 completeness of the set of tautologies of GNW, it suffices to 
show two implications: 

• if e is satisfiable classically, then e is not a weak tautology of GNW. 

• if then e is not a weak tautology of GNW, then e is satisfiable classi- 
cally. 

For the first item, assume that e is satisfiable, i.e., there is an assignment v 
of 0's and l's to the propositional variables of e which makes e into 1. We 
construct a 3- valued assignment w which makes e into 0. If v(p) = 1, we let 
w(p) = 1 and w(p) = 0. If v(p) = 0, we let w(p) = and w(p) = 1. In e' each 
variable has under w exactly the value of the corresponding literal in e has 
under v. So e' evaluates to 1, because connectives in GNW behave classically 
for classical arguments. In addition, each of the formulas (pVp) A (~pV ~p) 
evaluates to 1 , so altogether e evaluates to the ~ o-negation of the value to 
which e' does evaluate, which is 0, as desired. 

For the second item, assume there is an assignment w of 0's, l's and _L's to 
the propositional variables of e which makes it 0. It follows that each of the 
terms (pVp) A (~pV ~j5) must evaluate to 1 under w. Therefore of each pair 
p,p, one variable must be assigned 1 and the other by w, which can be 
checked by simple inspection of all possibilities. Moreover, e' must evaluate 
to 1 under w, which is indeed 2-valued, by the previous observation. The 
connectives of GNW act classically for classical arguments, therefore e is 
indeed classically satisfiable, by the valuation dijjh w{p). 
This finishes the proof. □ 



Theorem 19. The weak flat- conditional SAC is coNTF '-complete. 

Proof. We are going to use the same proof idea as before. However, we have 
a small problem. The conjunction of SAC does not permit us to deduce, 
that if a conjunction of two formulas evaluates to 1, so does each of the 
components. 

So instead of the original conjunction, we have to use some custom connec- 
tive defined from the conjunction, disjunction and negation, which will act 
as a "good" conjunction, for which the inference does hold. It turn out, 
that the conjunction of GNW is not definable in SAC, but there is another 
connective we can use instead, and which is definable (we discuss the defin- 
ability of connectives in SAC and GNW in another paper [ CWTHROlfl ) . Its 
definition is as follows: 



x n y = [x V (y A (xV ~y))] A [y V (x A (yV 
It is not difficult (but tedious) to check, that ((x n y))sAC has truth table 
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x l~l y 



x\y 





1 


_L 














1 





1 





_L 








_L 



which is exactly what we need for our purposes. The only subtle point is 
that our x U y is substantially longer than |x| + \y\. Indeed it is about 4 
times longer. We do replace A by n in very long conjunctions. However, 
if we represent this long conjunction as a balanced binary (parse) tree, i.e., 
insert brackets to obtain the structure 



(((...□...)□(.. .□...))□((.. .□...)□(.. .□...))), 

the depth of nesting of conjunctions is at most log base 2 of the number 
N of clauses in the conjunction, and the total increase of length caused by 
the replacement is 4 de P th of ncstin s = 4 lo ^ N = N 2 . Altogether, the resulting 
formula, using n in place of A, is still of polynomial size, and can be easily 
constructed in polynomial time, as needed. □ 



5.2 Independence of conditional events 

There has been a considerable amount of interest in the independence issue 
for conditional events, reflected in the cea literature jGoo94| , |Cal97| , |Pea88|1 . 



The problem is that typically even for a and b mutually independent of c 
and d one does not have Pr((a|6) A (c\d)) = Pr((a\b)) Pr((c|(f)). The only ex- 
ception is PS, where this equality holds. The other variant of independence: 
Pr((a|6) | (c|g?)) = Pr((a|6)) is undefined in some formalisms, due to the lack 
of re-conditioning operator, and fails in others. However, note that in the 
cea framework one cannot obtain any proper characterization of indepen- 
dence, because there is no underlying probabilistic semantics, in which one 
could say which pairs of conditionals are independent and which aren't, and 
then attempt to characterize this by equalities among probabilities. One 
feels that (a\b) and (c\d) should be independent for mutually independent 
arguments, but this is not more than a feeling, and there is no idea there 
what might make two conditionals independent when their arguments are 
not mutually independent, or when they are composite. 
We can address this problem in our semantical setting. First of all, for a and 
b mutually independent of c and d, the stochastic processes [(a|6)] and [(c|d)] 
are obviously independent. And of course, the independence of the stochastic 
processes is what the independence of conditionals should be. This remains 
true, no matter which cea we consider. It is, however, a different story if this 
independence can be formally characterized in terms of equalities between 
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probabilities of conditionals in the cea under consideration. It appears that 
in the pure cea formalism this cannot be achieved, because in Theorem ^ 
below we show that independence is undefinable in the PS cea. 
To be precise, the full independence of stochastic processes X, y means that 
the full histories of both processes are independent, which is different from 
the much less restrictive requirement that just the present time values should 
be independent. The first version is formalized by the requirement that X+,t 
and y+,t are independent at any time t > 0, i.e., for any w\ . . . wt, v 1 . . . v t £ 
3* holds 



The weaker, present tense independence requires only that Xt and Yt are 
independent at any time t > 0, i.e., that for any w,v £ 3 holds Pr(Xt = 
w, Yt = v) = Vi(Xt = w) Pr(Yj = v). To see the difference it is worth noting 
that for any present tense (TL|TL) formula (<£>|V0 the processes [(</?|V')l an d 
[(• ip\ • ip)J are present tense independent, although of course they are easily 
seen to be dependent, unless the former is constant. 
But let us note the following simple fact. 

Lemma 20. If C\ and C2 are two present tense conditionals, they are inde- 
pendent iff they are present tense independent. □ 

We know now what independence should mean. It is another story how to 
characterize it in terms of the asymptotic probability of conditionals. 
First we prove the characterization for present tense independence at fixed 
time. 

Let ]{a\b) := (b\true). 

Lemma 21. Let n be a fixed time instant. The following are equivalent: 

• Random variables {(a\b)J n and [(c|d)] n are independent. 

• The following four equalities hold: 




Y t = vtJ 

PT(X 1 = w 1 ,...,X t = w t )PT(Y 1 = v 1 ,...,Y t = v t ). 



Pr n ((a|6) A Sch (c\d)) = Pr n ((a|6))Pr ft ((c|d)) 
Pr n ((o|6) A Sch Uc\d)) = Pr n ((o|6)) Pr n (T(c|d)) 
Pr n (|(a|&) A Sch (c\d)) = Pr„(T(a|6))Pr„((c|d)) 

Pr n (|(a|6) A T(c|0) = Prn(T(a|6)) Pr„(T(c|d)) 



(14) 
(15) 
(16) 
(17) 



where we assume an equation to hold in case when both sides are un- 
defined. 
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Proof. JJ- Independence of random variables [(«!&)]« and [(c|d)] n implies, in 
particular, that 



Pr([(a|6)]„ = 0, 1, l(c\d)] n = 0, 1) = Pr([(a|6)]„ = 0, 1) Pr([(c|d)]„ = 0, 1), 

(00 



which is exactly equivalent to (|17|), The other consequences of independence 
are equalities 



Pr([(a|6)] n = 1, [(c\d)] = 1) = Pr([(a|6)] n = 1) Pr([(c|d)] = 1) (gf) 
Pr(I(a|6)] n = 1, {(c\d)j n = 0, 1) = Pr([(a|6)] n = 1) Pr([(c|d)]„ = 0, 1) 

(00 

Pr(I(o|6)] n = 0, 1, [(c|d)]„ = 1) = Pr([(a|6)] n = 0, 1) Pr([(c|d)] n = 1), 

(00 

which, divided by (17'), yield fll4"|), (|l5| ) and (0), respectively. Note that if 
both sides of (17') are 0, then all the resulting equalities involve an undefined 
term on both sides, and hence hold, according to our convention. 

fT Let (|l7[) (i.e., (fl7|)) hold. If its both sides are 0, the random variables 
[(a|6)] n and [(c|d)] n are independent, because one of them is constant. So 
let us assume (|l7f) holds and its both sides are nonzero. In particular, each 



of the (14), p5| ) and ( JTq ) is defined on both sides, because the denominators 
are everywhere nonzero. Multiplying these equalities by (|l7]0> we S e ^ (00 > 
fll5f) and (|l6f), respectively. It is now a matter of routine to prove that the 
independence of [(a|6)] n and [(c|d)] n follows from (140) dUf) an d (HO an( ^ 

flnf). " □ 

The lemma allows us to characterize independence for present tense condi- 
tionals. 

Theorem 22. For present tense (a\b) and (c\d) the following are equivalent: 

• Stochastic processes l(a\b)J and {(c\d)\ are independent. 

• The equalities (|l4|) -([l7|) hold with Pr n replaced by Pr in each term, 
where we again assume an equation to hold in case when both sides 
are undefined. 

Proof. For present tense (a\b) the probability Pr n ((a|6)) is independent of 
n, and is (of course) equal to Pr((o|6)). Now Lemmas ^ and ^l] give us the 
desired equivalence. □ 
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The full characterization of independence for general temporal conditionals 
is not known at the moment. Most likely, if it at all exists, it must be 
nonuniform, in the sense that the number of equalities between probabilities 
depends in principle on the actual (f\ip) and (CIO- 

However, there is a quite general sufficient condition for independence, which 
can be (nonuniformly) characterized by equalities of asymptotic probability. 
Call two conditionals a, b strongly independent iff there exist stochastically 
independent Markov chains X and y and projections h,g such that [a] = 
h(X) and [6] = gty). 

Theorem 23. Strong independence of conditional events from (TL|TL) can 
be equivalently characterized by equations of asymptotic probability. 

We begin with 

Lemma 24. Let Markov chains X, y have n and m states, respectively. If 
X and y are independent until time mn + 1, they are fully independent, i.e., 
^f 



Pr 



f X\ =w\, . . . , X t = wt* 
K Yi = vi, Y t = v t 

Pr(X 1 =wi,...,X t = w t )Pi(Y 1 =v 1 ,...,Y t = v t ) (18) 



holds for all t < mn + 1 and all sequences wi, . . . , wt, v\, . . . , vt of states of 
X and y, respectively, then X and y are independent and ([H|) holds indeed 
for all t. 

Proof. First of all, observe that (X, y) = (Xi, Y\), (X%, Y2), . . . is a Markov 
chain, as well. 



Suppose that (18) fails and that the least t for which it fails is t > mn + 1 
(because for t < mn + 1 ( |l8| ) holds by assumption). 
The in-equality 

p (X\ = Wi, X t = w t \ , 

\Y 1 = v l , Y t = v t ) r 

Pr{X 1 =wi,...,X t = w t )Pi(Y 1 =v 1 ,...,Y t = v t ) (19) 
is by Markov property (§) for X, y and (X, y) equivalent to 



Pr 



Pi(Xt = 



■ ■ ■ , X t -i — 

y t -i = 

wt, ■ ■ -,X t -t = 
Pr(Y! = Vl ., 



wt-i 
v t -t 



Pr 



X t = w t X t -t = w t -i 
Y t = v t Y t _t = v t -i , 

w t -i) Pr(X t = w t \X t -t = w t -i) x 

. . , Y t -t = v t -t) Pr(Y t = vt\Y t -t = vt-i), 
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which in turn is equivalent to 



Pr 



f X t = w t 
K Y t = v t 



X t -i = w t -i\ , 
Y t -i = «t_i J * 

Pr(X t = w t \X t ^ = wt-i) Pr(y t = Wt^-i = vt-i), (20) 



because t is the least one for which in-equality holds, and so the non- 
conditional probabilities in the previous in-equality cancel out. 
Moreover, the canceling terms must be nonzero for the in-equality to hold, 
which means (wt-i,vt~i) is reachable with positive probability from the 
initial state in (X, y). But therefore it must be reachable with positive prob- 
ability in at most mn steps, because there are exactly so many states in 
(X, y). So let (xi,yi), i = 1, . . . ,s < mn be a sequence of states of (X, V) 
leading to (x s ,y s ) = (wt-i,v t -i) with positive probability. By assumption 



p /Xl = 2l, X s =x s \ = 

\Yi = y 1 , Y s = v s ) 

PT(X 1 = x 1 ,...,X s = x s )Pr(Y 1 = y 1 ,...,Y s = v s ), (21) 

because s < mn. If we now multiply the above by (p0[), we get, by a calcula- 
tion reverse to what we have done above, an instance of (|H]) with t < mn+1, 
a contradiction. □ 



Lemma 25. For given Markov chains X and y and for a fixed time t, fixed 
sequences w\, . . . , wt and vi, . . . , v t of states ofX and V, respectively, the for- 
can be equivalently characterized by equalities among asymptotic 
probabilities of certain conditionals, derived from X and}$. 

Proof. Let 21 and 03 be the deterministic finite automata, underlying X and 
y. For a state w of 21 let 21^ be the Moore machine resulting from 21 by 
labeling the state w with 1 and all the remaining states with 0. Since all 
Sl^'s are 2-valued, there exist TL formulas a w , which are true precisely when 
the last symbol of the output of 21™ is 1. Similarly we define Q3„ and (3 V . 
Now (|i~8|) is equivalent to 

t 

Pr(« truef\^9 + true A f\(m ~\a w . A /?„.))) = 

i=i 

t 

Pr(« true A -i • + true A /\(* '(a^))) X 

i=l 

t 

Pr(» frueAn« + true A A (• *(AJ)). 

i=i 
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Each of the TL formulas asserts that it has been once that there was some- 
thing t — 1 steps ago, but there was nothing t steps ago (so we have been at 
time t precisely), and we were in the prescribed states of the Markov chain 
in question at times: t, one step before that, ... , t — 1 steps before that. 

□ 

What remains to be seen is that we can indeed choose some canonical Markov 
chains X and y to represent a and b, which are independent whenever a and 
b are strongly independent. 

Let us recall, that any conditional event in our model is a projection of a 
Markov chain, derived from a Moore machine for the underlying conditional 
object. Since for every Moore machine there exists the minimal Moore 
machine computing the same function, in presence of probabilities, we thus 
always have the minimal Markov chain underlying any given conditional 
event. 

Lemma 26. Let a andb be strongly independent. Then the minimal Markov 
chains for a and b are independent. 

Proof. Let X and y be two independent Markov chains, underlying a and 
b. Applying the quotient construction to X and y we pass to the minimal 
Markov chains underlying a and b. The quotient construction is determinis- 
tic, and therefore it does not break independence (exactly like strong inde- 
pendence implies independence). It follows that the minimal Markov chains 
are independent, too. □ 

Proof of Theorem |^|. The conditional events a and b are strongly indepen- 
dent iff the minimal Markov chains underlying them are independent, by 



Lemma 2C. The latter can be expressed equivalently by (run + l) mn con- 
ditions of the form (|i~8| ) for minimal chains of m and n states, respectively, 
by Lemma ^4]. Each of these conditions in turn can be expressed equiva- 
lently by a single equality of asymptotic probabilities of certain conditional 
objects. This means that the strong independence of a and b can be equiv- 
alently characterized by a set of equalities among asymptotic probabilities 
of conditionals, which can be syntactically determined from a and b and do 
not depend on the probability space structure. □ 

Of course, for conditionals which are themselves Markov chains for any prob- 
ability assignment, strong independence is the same as independence. There- 
fore we have 

Corollary 27. For conditionals which are themselves Markov chains for 
any probability assignment, independence can be characterized by equalities 
of asymptotic probabilities. □ 
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The conditionals to which this applies can be recognized by the property that 
their minimal Moore machine has at most one state labeled by each element 
of 3 (and thus at most three states altogether). Present tense conditionals 
are of this kind, and thus we have an alternative proof of Theorem 22, which 
much less elegant set of equalities, however. But present tense conditionals 
do not exhaust all conditionals, which are Markov chains. An example is 
the conditional (aj ■((• a — > aP) A (• a? — > a) A (-> • true — > a))), analyzed 
in [TRH01]. Its minimal Moore machine is depicted below. 




Figure 7: Moore machine of (o| ■((• a —* a^)A(» o? — > a) A(-> • true — ► a))). 



Therefore Theorem 23 is indeed stronger than Theorem 22. 
Finally, we consider the question of PS cea, for which one might want a 
characterization of independence in terms of asymptotic probability. Here 
we give a negative answer. 

Theorem 28. There exist two conditional expressions e\ and e2 and a prob- 
ability space such that the embeddings r(ei) andr^) are independent, while 
their sparse reverse counterparts r RS (e) and r RS (e2) are not independent. 

Proof. Take e± = e2 = (0|a) and any probability space with < Pr(a) < 1. 
Then [r(ei)J and [t^)] are constant processes, equal to 0, so they are 
(trivially) independent. However, already 



>(Pr(([r RS ( ei )])H=0J_)) 2 

= Pr([r RS ( ei )])H = 0±) • Pr([r RS (e 2 )])H = 0J_), 

where the inequality holds because Pr([T RS (ei)])(z/;) = 0_L) = Pr(a)(l — 
Pr(a))^0,l. □ 
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Corollary 29. There is no characterization of independence in PS cea in 
terms of equalities of asymptotic probability. 

Proof. Because both r(-) and t rs (-) preserve all asymptotic probabilities of 
conditionals, both of them satisfy precisely the same equalities of asymptotic 
probabilities. So if there were a characterization of independence in terms 
of equalities of such probabilities, the interpretations of the two conditionals 
(a\0) and (a|0) above would have to be either independent in both cases, or 
dependent in both cases, while they are not, a contradiction. □ 

The consequence is that in the cea formalism is not expressive enough to de- 
fine independence of conditionals by means of equalities of asymptotic prob- 
abilities. Note however, that such a representation is certainly possible by 
means of equalities of probabilities and equalities of the algebraic structure. 
Indeed, PS cea is boolean algebra with respect to its connectives A, V,~ 
(as it is easily visible from its syntactic representation within (TL|TL)), 
and the equalities it satisfies enforce, that Pr is an ordinary probability 
measure. Therefore independence is equivalent to the standard equality 
Pr (((ei A 

^2))ps) — P^o (((ei))ps)Pr (((e2))ps)- What we have constructed 
are two non-boolean subsystems of (TL|TL), in which all the (asymptotic) 
probability assignments agree with those of Pr , and yet no set of equalities 
of probabilities can characterize the true probabilistic independence in both 
of them simultaneously. 



5.3 Algorithms 

Polynomial algorithm for PS cea. Let us see that our approach pro- 
vides a nontrivial improvements to the algorithmic status of existing cea's. 
We will demonstrate this by calculating the probabilities of conditional ex- 
pressions, according to PS cea, in time polynomial in their size and expo- 
nential in the number of variables. (Note that the number of arguments for 
computation of the probability of an n-ary conditional is 2", so the above 



indeed means computation polynomial in the size of the input.) In [Goo94] 
it is stated that the computation of the PS-probability of a conjunction of 
n conditionals (aj|6j), according to the method used in that paper, requires 
adding J2m=i m ' ' So(m,n) ■ (2 m+1 — 2) terms, each being a nonconditional 
probability of a conjunction of certain events <2j and 6j. The number of sum- 
mands, where So(m, n) are Stirling's number of the second kind, is of order 
2 nlogn . It is substantially more than about c2 n one obtains for the present 



tense cea's SAC and GNW, and has been stressed in ]GMN97 , p. 499] and in 



[ Fou l, since it strongly affects the usefulness of PS as a tool for applications. 
Using our approach we have instantly an algorithm to calculate the same 
probability in 2°^ steps. As a matter of fact, this applies to any condi- 
tional expression with n arguments (aj|6j), i = 1 . . . , n, as long as its length 
does not exceed 2°^ n \ All the complexity bounds given here assume unit 
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cost of basic arithmetical operations: addition, multiplication, subtraction 
and division. 

Theorem 30. There is an algorithm, computing the PS probability of an 
n-ary conditional expression of length m in time polynomial in max(m, 2 n ). 

Proof. The first step of the algorithm on input expression e is to construct 
the minimal Moore machine, computing r(e). 

Lemma 31. The minimal Moore machine of r(e) for n-ary conditional ex- 
pression e has at most 3™ states. 

Proof. The minimal Moore of first(a|6) has 3 states (see @ and Figure ||). 
By the definition of the r(-) embedding (Section |4.5| ), a Moore machine 
21 = (Q, Q, 5, h, qo) of r(e) can be constructed as follows: 
The set Q of states of 21 is the product Q\ x ■ ■ ■ x Q n of state sets of Moore 
machines 21, = (Qi,£t,6i,hi,qoi) of all expressions first (aj|6j) occurring in 
r(e). The transition function of 21 is defined coordinate-wise, i.e., 

8({qx,...,q n ),u) = (6i(qi,u),...,6 n (q n ,u)), 
the initial state is qo = (qoi, ■ ■ ■ , qo n ), and, crucially, 

h((qi, q n )) = c(hi(qi), h n (q n )), 

where c(hi(qi), . . . , h n (q n )) is the classical logic evaluation of the expression 
c on arguments hi(qi), . . . , h n (q n ) £ 2. 

This product construction is well-known for automata theory, and it is im- 
mediate that it does the work. □ 

So it is quite easy to construct, given e € £, the Moore machine of r(e). Now 
we have to turn this Moore machine into a Markov chain. Assuming that 
all the probabilities of atomic events from are given, we simply replace 
multiple transitions between the same states represented by the sum of their 
probabilities — a single number. 

Furthermore, the Markov chain we obtain is absorbing, i.e., it has one- 
element ergodic classes. It can be proven by a straightforward induction on 
n — the number of three element Moore machines we product. It follows 
jCS76l , Chapter III] that we can use the following method to compute the 
limiting probability that the chain finally arrives at a state labeled by 1. 
Clearly, in this situation we can collapse all absorbing states labeled 1 into 
a single such state. 

Denote by P the matrix (p(i,j)) of transition probabilities, by Q the sub- 
matrix of rows and columns corresponding to transient states, and by R 
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the submatrix of rows corresponding to transient states and columns corre- 
sponding to absorbing states. Let Id be a diagonal matrix with l's on the 
diagonal and 0's elsewhere. Let B = (Id — Q)~ l R. Then the probability 
we are looking for is the entry in B in the row corresponding to the initial 
state in in the column corresponding to the (only) absorbing state labeled 
by 1 in the Markov chain. Since all the calculations on matrices necessary 
to compute B are doable in time polynomial in the size of the matrices, the 
total computation time is (2 n )°^ = 2°( n \ as desired. □ 

6 Summary 

We have discussed the temporal calculus of conditional objects and condi- 
tional events (TL|TL) as a formalism alternative to conditional event alge- 
bras. 

We have shown that all the major conditional event algebras, including 
those of Schay-Adams-Calabrese, Goodman-Nguyen- Walker and the prod- 
uct space cea, embed isomorphically in (TL|TL). 
Moreover, (TL|TL) is superior to those formalisms in several ways: 

• It provides natural, probabilistic semantics of conditionals, allowing 
one to construct experiments to evaluate all their interesting proba- 
bilistic parameters, unlike cea's, which generally are not probability 
spaces, and which do not require certain probabilistic parameters to 
be defined at all. 

• The construction of (TL|TL) is functorial, in the sense, that the un- 
derlying probabilistic space of nonconditional events determines the 
space of temporal conditional events uniquely, while cea's generally 
are not unique. 

• The formalism of (TLjTL) allows one to define and analyze indepen- 
dence of conditional events, which is difficult or impossible in cea's. 

• (TL|TL) offers better algorithms for calculation of probabilities, than 
those known previously for cea's. 
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